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Abstract. Generalizing the notion of split graphs to uniform hypergraphs, 
we prove that the class of these hypergraphs can be characterized by a finite 
list of excluded induced subhypergraphs. We show that a characterization by 
generalized degree sequences is impossible, unlike in the well-known case of 
split graphs. We also give an algorithm to decide whether a given uniform 
hypergraph is a split hypergraph. If it is, the algorithm gives a splitting of it; 
the running time is 0(N \og N). These answer questions of Sloan, Gy. Turan 
and Peled. 



§1. Introduction, split graphs. 

We call a graph G split if there exists a partition A U B of its vertex set so that there 
is no edge between any two points of A and there is an edge between every two points of 
B. Such a partition A U B of V(G) is called a splitting. 

The following theorem describes split graphs in two different ways: 

THEOREM 1.1. For a graph G the following are equivalent: 

(1) G is split. 

(2) G does not contain C±, C§ or K% U K2 as an induced subgraph. 

(3) There is an m such that YmLi di —m(m— 1) = J27=m+i di> where d\ > di > . . . > 
d n are the degrees in G. 

The equivalence of (1) and (2) is first shown in [5], while the equivalence of (1) and 
(3) is in [6]. See also [1] or [4] for history and further references. 

The generalizations of many graph theoretical questions to hypergraphs have practical 
significance. Most frequently, posing these problems for hypergraphs makes them rather 
difficult. For example, testing the existence of a perfect matching can be solved by a well- 
known polynomial algorithm for graphs. Generalizing it to 3-uniform hypergraphs we get 
an NP-complete problem ([7], [3]). 
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The class of line-graphs can be characterized by a finite list of excluded induced 
subgraphs — on the other hand there are infinitely many pairwise nonisomorphic graphs 
with the property that none of them is the line-graph of any 3-uniform hypergraph, but 
every proper subgraph of any of them is, [8]. Studying extremal questions for graphs 
turned out to be very fruitful. The generalization of this theory to hypergraphs comes up 
against serious difficulties. 

Recent research in learning theory has led to a generalization of the notion of split- 
graphs, [9]. Call a /c-uniform hypergraph G a split-hypergraph if its vertices can be parti- 
tioned into two classes so that every /c-tuple in one of the classes is an edge and no /c-tuple 
in the other class is an edge in G. We call such a partition a splitting. Note that our defi- 
nition allows one of the classes to have size smaller than k, in which case the requirement 
on that class automatically holds. 

When G is not a split hypergraph, we say that it is not split or non-split. For a 
hypergraph G, V(G) will denote its vertex set and E(G) its edge set. From now on, n 
denotes \V(G)\ and N is \V(G)\ + \E(G)\. 

The question arose naturally: can we state something similar to Theorem 1.1 for the 
case of split hypergraphs? U. N. Peled asked whether the class of /c-uniform split hyper- 
graphs can be characterized by a finite list of excluded induced subhypergraphs. We give 
an affirmative answer to this question in the next section. Section 3 gives examples of min- 
imal non-split hypergraphs in the 3-uniform case. We did not strive for the completeness 
of the list. Nevertheless, it may show that the number of excluded induced subhypergraphs 
increases considerably as we go from graphs to 3-uniform hypergraphs. In Section 4 we 
examine if the connection between degree sequences and the split property of graphs can 
be generalized. We find a negative answer. 

In their paper [9], where they apply split hypergraphs in learning theory, R. H. Sloan 
and Gy. Turdn gave an algorithm that decides if a /c-uniform hypergraph is split and 
gives a splitting if possible. The running time of their algorithm is 0(n 2k ); they men- 
tion that finding a more efficient algorithm is an open question. Feder, Hell, Klein and 
Motwani introduce the notion of a sparse and dense partition of a graph, which is a kind 
of generalization of a splitting. Their definition can be extended to hypergraphs and the 
algorithm they present (in Theorem 3.1) can be used to find a splitting (or all splittings) of 
a uniform hypergraph. The running time of their algorithm in this context is the same as 
the time requirement of the one given by Sloan and Turan. In the last section we give an 
0(N log N) algorithm for the problem. Our method can also be used to give an algorithm 
for sparse and dense partitions that is faster than the one in [2] in certain cases. 

The restriction of an edge to an X C V(G) is its intersection with X. Minimality 
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of a graph with respect to a certain property will always mean that no induced proper 
subgraph of it has the property. As usual, XAY will stand for the symmetric difference 
between sets X and Y. When x G V{G), we sometimes simply use x instead of {x} in set 
operations. 

§2. The existence of a finite characterization. 

In this section we prove our main theorem. First we state two lemmas. The following 
lemma holds for hypergraphs with edges of sizes at most k, which is a relaxation of k- 
uniformity, and is convenient for our inductive proof. 

Lemma 2.1. Let k and m be positive integers and Y a hypergraph with edges of size at most 
k. Then Y has at most k m different minimal cover sets of at most m points. Furthermore, 
there is an algorithm of time requirement 0(\V(T)\ + \E(T)\) that outputs a list of these 
minimal cover sets. 

Because of the short running time of our algorithm, some more details about the 
implementation could be added. However, designing data structure and the implementa- 
tion of the algorithm does not match the texture of our paper, so we leave these to the 
interested reader. 

Proof. We prove the first part of the claim by induction on m. The case m = 1 is obvious. 
Suppose that for m < r the statement is true, and let Y be any hypergraph with edges of 
size at most k. Let H be an arbitrary edge. Any minimal cover set of Y with at most m 
points consists of a point x of H and a minimal cover set of r|y(p)\{x} with at most m — 1 
elements. Applying the induction hypothesis to the latter one and noting that x could be 
chosen in k different ways, we get the statement for m = r. 

The algorithm will imitate the proof in the previous paragraph. For i = 1, . . . ,m, 
pick the first edge that has not been deleted yet, a vertex Vi of e^, delete the edges that 
contain Vi, and let i := i + 1. The output is {y\, . . . , v m }. Since the number of minimal 
cover sets is bounded by a constant, we can list all of them by systematically going through 
all possible ways of picking Vi in the sequence of e*, and still use only linear time. I 

In what follows, for the sake of simplicity, the edges of the /c-uniform hypergraph G 
will be referred to as type edges, and the /c-tuples not contained in E(G) as type 1 edges. 
(Thus we identify G with a labeling of the edges of a complete /c-uniform hypergraph by 
numbers and 1.) 

Consider some minimal non-split hypergraph G. Minimality implies that for any 
x G G there is a splitting (Aq,A*) of G \ x, i.e. Aq induces only edges of type 0, Af 
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induces only edges of type 1. (A class that only contains edges of type i will sometimes be 
called a type-i- class.) 

Lemma 2.2. Fix a splitting (A%,A\) of G \ v for every v G V(G). If x,y G V(G) are 
vertices in a minimal non-split hypergraph G then \A* n A\_^ < k and \AfAAf\ < 2k for 
i = 0,1. 

Proof. The first inequality follows by the fact that a /c-tuple cannot be contained in the 
type i class of a splitting (of G\x) and a type 1 — i class of a splitting (of G\y) at the 
same time. For the second inequality, use the facts 

Af\^c^fnK_ i u{y}), 

and \Af n (A\_ i U {y})\ < k by the first inequality; similarly with x and j/ interchanged. I 

In the proof we shall consider sets subindexed by or 1. Such a set, of subindex i, 
will always be a class of edges of type i, or some "preliminary set" in constructing a class 
only with edges of type i. This notation will provide us with the following convenience in 
terminology. Given some set S C V(G) by a name subindexed by i (i = 0, 1), let the bad 
edges of S be the edges of type 1 — i on S. 

Theorem 2.3. If a k-uniform non-split hypergraph has more than 4(k + i)/ c 4fc ( fc + 1 )+ 1 vtr _ 
tices then it has a proper induced subhypergraph that is non-split. 

Fix k. We shall give an algorithm. Its brief description is given in the next two 
paragraphs. The input is G, x, (Aq, A*), a, /3, where G is a minimal non-split hypergraph, 
x G V(G), (Ag, Af) is a splitting of G \ x, a G {1, . . . , 4fc}, G {1, . . . , k + 1}. The 
algorithm either stops without any output, or outputs a splitting of G\y for some y G V(G). 
It has a randomized part, to simplify description. 

At the beginning our algorithm adds x to Aq, creating some type 1 edges (bad edges) 
in So := Aq U x. There is another class B 1: set to be A\ at the start. Then we choose a 
random minimal cover set of size at most k + 1 for the bad edges, and move it over from 
So to Bi. In general, at each step at most one of So and B\ contains any bad edges, 
depending on the parity of the step. The algorithm always chooses uniformly a random 
minimal cover set of size at most k + 1 for these bad edges, and consisting of points that 
have not been moved between the two classes yet, and any such set is chosen with positive 
probability. (If there is no such set, then the algorithm fails.) Move this minimal cover set 
to the other class. When we arrive at the ct'th step, let the /3'th element of the minimal 
cover set be called y. (We may assume that there is some fixed ordering on any subset of 
the vertices, for example the one given by the listing of the vertices in the input.) Remove 
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y from the minimal cover set (and hence from both classes in the future), and continue 
the procedure. We run the algorithm for at most Ak steps. With positive probability, at 
some step, (B , B\) will be a splitting of G \ y, and this is the output. 

The inputs a and /3, together with the choices that the random part of the algorithm 
makes, determine y. Using Lemma 2.1, we shall give a bound on the number of possible 
choices, and finally get the general upper bound in Theorem 2.3 for the number of vertices 
in G. 

Here is a formal presentation of the algorithm. 
Algorithm 1 

FUNCTION BadEdges(z, BO returns the list of type \-i (bad) edges of B { . 

FUNCTION MinCoverSet (R, L) returns a uniformly chosen random minimum cover 
set of R of size at most k+1 that is disjoint from L. If there is no such 
set then the algorithm ends and outputs ' 'Unsuccessful ; ; . 

Start of Algorithm 

INPUT: a /c-uniform hypergraph G, vertex x of G, a splitting (Aq,A^) of 
G\ x, numbers a G {1, . . . ,Ak} and (3 G {1, . . . , k + 1} . 

B :=A%U{x}, B 1 :=Af, L := , i:=0, j:=l, £ := 
while j < Ak do { 
R :=BadEdges(z, BO 

if R = then output (So, BO , end of algorithm 

£ :=MinCoverSet( J R, L) 

B l := B t \£, L:=LU£ 

if j = a then { if \£\ < (3 then output ( ' 'Unsuccessful ; ; ) , end of al- 
gorithm, 

else £:=£\{y}, where y is the /3'th element of £ } 

B 1 _ l :=B 1 _ i U£ 

i := 1 - i, j := j + 1 

} 

output ( f 'Unsuccessful' ; ) , end of algorithm 

Lemma 2.4. FixG,x and (A^Af) in the input. For any y G V(G), Algorithm 1 outputs 
a splitting of G\y with positive probability for some values of a, f3. 

Proof. Fix y, and a splitting (Aq, A\) of G \ y. None of the bad edges in Aq U {x} can 
be contained in A V Q , so there exists some minimal cover set of these bad edges that is 
contained in A\ U {y}. (Such a minimal cover set is chosen by MinCoverSet with positive 
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probability.) This minimal cover set cannot have more than k + 1 elements, because then 
a /c-tuple would be present in both Aq and A\, contradicting Lemma 2.2. Since the points 
that we moved are in A\ U {y} by our choice, they get to their final class by the move, so 
we may assume that they are not moved in later steps of the algorithm. Hence, when the 
algorithm chooses a minimal cover set of the bad edges of size at most k + 1 and moves 
this set from Aq U {x} to Af , (call the resulting sets Co and C\ respectively), we have 
|C AA^| < |AqAA^|, and also |CiAA^| < \A\ AA\\. These inequalities are crucial in that 
we succeed after at most 4k repetitions. When y is in £, if a is equal to the actual j and 
y is the /3'th element in £, then y is removed from £, and its removal ensures that the 
remaining graph has a splitting. (This possible removal causes the second inequality above 
to be not necessarily strict.) 

Similarly, denote by D and Di the sets So and B\ respectively at some step, and 
let Co and C\ be So and B\ respectively in the next step. Let I be the set that was 
moved, as in the algorithm; we may assume by symmetry that it was moved from Dq to 
Di. So, C = D \£ and C\ = D\ U {£ \ y). There were some bad edges in Dq, because the 
algorithm did not stop. None of these bad edges can be contained in Aq, so there exists 
some minimal cover set of the bad edges that is contained in Af. We may assume that the 
elements of this minimal cover set are points that have not been moved at any previous 
step (i.e., they are not in L), because we are considering the outcome of the algorithm (of 
positive probability), when the moved vertices must reach the type of class that contains 
them in (Aq,A\) by the move. (The only vertex not in any of Aq or A\ is y, but it is 
removed from £ if a and /3 are suitable.) The algorithm chooses such a minimal cover set 
£ with positive probability, and then we have 

\C AA y \ < \D AA y \ and \dAA\\ < |£>iAA?|. 

We conclude that |SoAAq| + \B\AA\ | decreases in each iteration step by at least 1 
with positive probability. At some step y has to be contained in £, since C itself is not 
split. If a and /3 are suitable, then y is removed from £ (and the future B U Si). Then 
the algorithm can stop for two reasons. The first one is when it outputs "Unsuccessful" 
because of the random choices that MinCoverSet made. However, we have seen that for a 
certain sequence of the random choices that MinCoverSet makes along the iteration steps, 
there always exists a minimal cover set of the bad edges consisting of points not moved 
yet, as long as the set of the bad edges is nonempty. On the other hand, when the set of 
bad edges becomes empty, the actual B and Bi is a splitting. This either happens when 
the algorithm has already run for 4k steps (j > 4k), in which case |S AAq| + |SiA^| has 
to be (at the first step \B AA V q\ + \BtAA\l = \A^AA y \ + \AfAA\\ < 4k, by Lemma 2.2), 
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and we achieved the splitting (A^,A\) as an output. Or there are no bad edges at some 
earlier step. In that case the actual (B 0: Bi) does not necessarily coincide with the fixed 
splitting (Aq, A\), but it is still a splitting of G \ y. I 

Proof of Theorem 2.3. Fix again G, x and (AQ,Af). According to Lemma 2.4, for every 
y G V(G) there is some input a, (3 such that with positive probability Algorithm 1 gives 
a splitting of G \ y. The corresponding sequence of £'s (along the iteration steps of the 
algorithm) and the a, j3 determine the output and y. There are at most k k+1 choices for 
£ in each step, by Lemma 2.1, and the rest of the algorithm is deterministic. So there 
are at most / c 4fc ( fc + 1 ) possible sequences of ts along the iteration steps of the algorithm. 
There are 4k(k + 1) possible inputs a, /?. Hence there are at most k 4h< - k+1 ^4k possible y's, 
which is our upper bound for the number of vertices in a minimal non-split /c-uniform 
hypergraph. I 

A straightforward corollary is our main theorem: 

Theorem 2.5. The family of k-uniform split hypergraphs can be characterized by a finite 
set of forbidden induced subhypergraphs . 



§3. 3-uniform minimal non-split hypergraphs. 

Although Theorem 2.3 gave a very rough upper bound on the size of minimal non- 
split hypergraphs, in this section we illustrate with a few examples that the list of excluded 
induced subhypergraphs increases considerably compared to the case of graphs. 

For \ V{G) | = 6, G is non-split if and only if for any edge its complement is also an edge 
and if the subhypergraph induced by any 4 points is neither the empty nor the complete 
3-uniform hypergraph. It is clear that these are minimal non-split hypergraphs too. 

We get examples of minimal non-split hypergraphs for the case of |V(G)| = 7, 8, 9 in 
the following manner. Write the vertices along a circle and let the edges be exactly those 
3-tuples that consist of three consecutive nodes along the circle. G is not split, since we 
could put into the type-O-class at most 3 vertices, but then the rest contains a type edge 
(and similarly if we put only 2 arbitrary points in the type-O-class). However, dropping 
any vertex, two suitably chosen points can cut up the remaining arc to parts with at most 
two vertices. In the case of 8 (respectively 9) vertices, we can erase 1 (respectively 1,2,3 or 
4) edges from the hypergraph just described, and similarly, we still get a minimal non-split 
hypergraph. The argument is nearly the same for these new graphs. 

We have found minimal 3-uniform non-split hypergraphs on vertex sets of 11 and 12 
elements, but the details are lengthy. The examples above may show that even in the 
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3-uniform case there are more than a hundred (and probably even much more) minimal 
excluded subhypergraphs. 

§4. Degree sequences of hypergraphs. 

The question arises, whether a characterization like that in part (iii) of Theorem 1.1 
can be given for split hypergraphs. Since the equivalence of (iii) and (i) provides us with an 
0(n) algorithm to decide if a graph is split or not, a characterization for split hypergraphs 
using degree sequences could be promising. 

Proposition 4.1. There are 3-uniform hypergraphs G and G' on the vertex set {1, . . . , n} 
such that G is split, G' is non-split, moreover di = d\ and dij = 5'^ for any i,j G 
{1,2, ... ,n}. Here di (d[) stands for the degree of i in G (G'); (5'-) is the number 
of edges containing both i and j in G (G' ). 

Proof. Call a 6-element subset of the vertices of a 3-uniform hypergraph H exchangeable 
if there is a list a\, a 2 , as, a_i, a_2, a_3 of these vertices so that {a\, a2, as}, {a,\, a_2, as}, 
{a_i, 02, 0-3}, {a_i, a_2, as} are edges of H, and the other 3-tuples not containing ai and 
a-i together for some i are not in H. We say that we exchange the exchangeable point 
set {a\, a2,as,a-i, a_2, a-s} when we erase the four edges described above and add the 3- 
tuples {a_i, a_2, a_3}, {a_i, a 2 , as}, \a\, a_2, 03}, {a\, a 2 , a~s} to E(H). We may think of 
the exchangeable 6 vertices as the nodes of an octahedron. Then being exchangeable means 
that, if the nodes were labeled appropriately, the four 3-tuples in E(H) are determined 
by four faces whose pairwise intersection is one point. Exchanging means that we replace 
these 3-tuples in E(H) by the other four faces of the octahedron. One can easily see that 
these are well-defined, the listing of the 6 vertices is essentially unique. On the other hand 
the degrees and the number of edges incident to two points does not change after applying 
an exchanging. Thus if we get a non-split hypergraph from a split hypergraph after serial 
exchanging, then the proposition follows. 

Now, denote X = {1, 2, 3, 4}, Y = {5, 6, 7, 8}. Define G on the vertex set {1, . . . , 14} 
as follows. Every 3-tuple of X U {9, 10, 11} is red, every 3-tuple of Y U {12, 13, 14} is blue, 
and if a 3-tuple intersects both X and {12, 13, 14} then it is blue, if a 3-tuple intersects 
both Y and {9, 10, 11} then it is red. Moreover, the 3-tuples of {9, 10, ... , 14} that have 
not been defined yet get colors so that {9, 10, ... , 14} give an exchangeable set of points, 
as in the first sentence of this proof with (a\, a 2 , as, a_i, a_ 2 , a_ 3 ) = (9, 10, 11, 12, 13, 14). 
The colors of the remaining 3-tuples are arbitrary. G is obviously split with color classes 
X U {9, 10, 11} as red and Y U {12, 13, 14} as blue. Let G' be the hypergraph obtained 
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from G by exchanging {9, . . . , 14}. In G', {9, 10, 11} has become blue, so at least one of 
its points must be put to the other class. But there this point gives a red edge with any 
two points of Y. Since at least two points of Y must remain in the blue class, we conclude 
that G' is indeed non-split. I 

For an arbitrary k, one can give a /c-uniform counterexample with a construction 
similar to the one above, using the /c-dimensional cross-polytope {x G M. k : \\x\\i < 1} 
(the "octahedron" of dimension k). 



§5. Algorithm for splitting. 

Theorem 5.1. For an arbitrary, but fixed k there is an algorithm that decides if a G k- 
uniform hypergraph is a split hypergraph, and gives a splitting when it is. The running 
time is 0(N log N) (where we defined N = \V(G)\ + \E(G)\). 

As in the case of Lemma 2.1, we mention that the running time is understood with ap- 
propriately chosen data structure. Let us also point out that the constant in the 0(N log N) 
bound is exponential in k. 

The following lemma is used repeatedly, to obtain splittings of larger and larger sub- 
hypergraphs of G. 

Lemma 5.2. Let H be a k-uniform hypergraph, and (V 1 ,^ 2 ) be a partition of V(H). 
Denote by H 1 and H 2 the subhypergraphs of H induced by V 1 and V 2 respectively. Suppose 
that (Aq, A\) is a splitting of H and (Aq, A\) is a splitting of W (i = 1, 2). Then |A,-\(A}u 
A 2 j )\<2k-2,j = 0,l. 

Proof. Otherwise some /c-tuple in Aj would be present in A\_j or A\_- (j e {0, 1}), 
contradicting the fact that both are monochromatic of different types. I 

We present an algorithm whose existence proves Theorem 5.1. Call it Algorithm 2. 
We do not give the pidgin Pascal program this time, because the main structure of the 
algorithm is very simple, and the more particular elements are similar to Algorithm 1 
(except that the random part is replaced by a deterministic choice, as we shall see). Let 
me start with an overview of the algorithm. We use notation (m) for the modulo 2 value 
of an integer m. 

The crucial component of our algorithm is a subroutine called Tree. Its input is 
(if, X , Xl), where if is a /c-uniform hypergraph, and (X , Xi) partitions V(H). Tree will 
create labels for the vertices of a rooted tree T of depth 2k — 2, where each inner node has 
k 2k ~ 2 children. For each vertex, we assume that there is some ordering on its children, 
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so we can talk about the "m'th child" (when m < k 2k ~ 2 ). Similarly, we assume that for 
any set of /c-tuples of G, there is some fixed ordering on the minimal cover sets of size at 
most 2k — 2 of these /c-tuples, so we can talk about the "m'th minimal cover set" (when 
m is small enough). Note that such an ordering of the minimal cover sets can be easily 
implemented by an algorithm that is linear in the input size, using Lemma 2.1. Finally, 
fix some ordering on the vertices of T, so that vertices at smaller depth precede vertices 
at larger depth. 

Say that the root has depth 0, its children have depth i, etc. The root r will have label 
(Aq, A[) := (A , Ai). If Xq contains some edge of type 1 in G, let s := 0. Otherwise let 
s := 1. 

Suppose that v is a vertex in T whose parent is u, and v is at depth g. Assume 
that v is the m'th child of u. Let v get label (£ v ,Xq,X\), where £ v is the m'th minimal 
cover set of size at most 2k — 2 of the bad edges in X™ g+s y let X^ g+s ^ := X™ g+s ^ \ £ v and 
X^ 1+g+s ^ = X™ 1+g+s j U £ v . If m is such that there is no m'th minimal cover set of size at 
most 2k — 2 for the actual bad edges, then delete v and all its offspring from T. If there 
are no bad edges in X g+S , then Tree outputs the first such (Xq,X^) (in the ordering of 
the vertices), which is a splitting of the input graph H. If there are bad edges for all the 
leaves, then Tree outputs "No" , indicating H is not split. 

Algorithm 2 receives input G, a /c-uniform hypergraph. In the preparational step 
it partitions V(G) to sets A±, . . . , A^ n / k _ 1 -] [ , where each Ai has k — 1 elements with the 
possible exception of ^4[ n /fe-i]- Then we call the iteration part of the algorithm, with 
input ((A 1 ,0),...,(A ln/k _ 1] ,0)). 

In each cycle of the iteration part of Algorithm 2 there is an incoming list ((Aq , A| ), 
(A 2 , X?),..., (AT, Af)) , where (A*, X\) is a splitting of G\ x < uX{ , and the (A* U Aj)- 1 
give a partition of V(G). Now form pairs (Y^Y{) := (A^" 1 U Xq\ X 2i ~ l U Xf) as % = 
1, . . . , [m/2j ; if m is odd then define (Y^ m/i] , y/ m / 2 ^ ) : = (X™, A x m ). Now, call Tree for 
each i e {1, . . . , [m/2] }, with input (Yg , Y{). If any of these returns "No" , then Algorithm 
2 ends and outputs U G has no splitting". Otherwise, if Tree returns (A l , A\) at the i'th 
call then the input for the next iteration step is ((Aq, A{), (Aq, Af), . . . , (A^™^ 2 ^ , A[ m ^ 2 ^ )) . 
The last cycle of the iteration is when m = 2, that is, when we only call Tree once. Suppose 
that then it returns the pair (Aq, A\). In this case Algorithm 2 outputs this pair, which 
is a splitting of G. 

Lemma 5.3. Algorithm 2 provides a splitting of G if there is any, and answers "G has 
no splitting" if G is non-split. 

Proof. We shall prove that Tree indeed decides if there is a splitting of the hypergraph 
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that it receives while Algorithm 2 is running and outputs a splitting if there is one. (Note 
that the hypergraphs that Tree receives in Algorithm 2 have a very specific form.) Then 
it is clear that the iteration steps give splittings for families of bigger and bigger graphs 
and finally for G, if G is split. So, as in Lemma 5.2, let H be a /c-uniform hypergraph, 
and (V 1 ^ 2 ) be a partition of V(H). Denote by H 1 and H 2 the subhypergraphs of H 
induced by V 1 and V 2 respectively. Suppose that (A l 0: A\) is a splitting of H l (i = 1,2). 
We need to show that if the input is (H : Aq U Aq, A\ U A\) then Tree finds a splitting of 
H if there is any, and answers "No" if H is not split. Note that the inputs that Tree can 
get in Algorithm 2 indeed have this special form. 

Suppose that H has a splitting, fix one such (A , Ai). The process of finding a splitting 
of H corresponds to a walk on T, starting from the root and going in each step towards a 
child. Each step of the walk corresponds to finding a minimal cover set for the bad edges 
in the class being examined actually, and putting this set over to the other class (starting 
with the two classes from the input of Tree). By the same argument as in Lemma 2.4 
(using the analogue Lemma 5.2 of Lemma 2.2 now), we shall conclude that if there is a 
splitting, we get it in < 2k — 2 steps, or equivalently, at the latest when arriving to a leaf 
of T. 

To be more detailed, suppose that (Aq U Aq 7 A\ U A 2 ) —: (£? , Bi) is not a splitting yet 
(otherwise we are done, the algorithm outputs the splitting after making the label for the 
root). Then there are type 1 edges in B or there are type edges in Bi (call any of these 
a bad edge). We may assume by symmetry that Bq has bad edges. Then some minimal 
cover set of these bad edges is contained in A\, and the root has a child v such that £ v 
is this minimal cover set . The iteration part of Algorithm 2 moves over £ v from B to 
B u hence creating X% and Xf. We have \X%AA \ + \XfAAx\ < \B AA \ + |£iAAi|, 
similarly to the displayed line in the proof of Lemma 2.4. Note that after the first step of 
the walk on T only one of Xq and X% can have bad edges. 

Proceed similarly: when the walk in T is in u and the bad edges are in Xf (i G {0, 1}), 
there is a minimal cover set for these bad edges that is contained in Ai- i: and such that it 
is disjoint from \J£ W: where w ranges through the ancestors of u. (This latter assumption 
can be made because in each step we moved points that got to their "final class" in (A , Ax) 
by this move.) Some child v of u is such that this minimal cover set is £ v , and for the 
arising (Xq,X^) we again have 

\X%AA \ + \X^AA X \ < \X%AA \ + {X^AA^. 

Since for the root r we have |XqAA | + |Xf AAi| < 2k — 2 by Lemma 5.2, in at most 
2k — 2 steps we necessarily arrive to a vertex where the two casses contain no bad edges. 
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At this step Tree outputs the two classes, which is a splitting of H. 

If H is not split, then it is clear that there are bad edges in some class for any partition 
of the vertex set, so Tree will eventually output "No". I 

Lemma 5.4. The algorithm runs in 0(N log N) time. 

Proof. First, Tree runs in linear time, because the size of the underlying tree is a constant 
(determined by k) , and the labels consisting of minimal cover sets can be constructed in a 
number of steps that is linear in the input of Tree, using the method in Lemma 2.1. 

In each iteration step each element of V(G) U E(G) is present in at most one of the 
subhypergraphs that some Tree subroutine receives, and then in each iteration cycle there 
is a linear number of extra steps when the iteration cycle creates what it returns from the 
outputs of the Tree subroutines. This is O(N), and there are 0(log N) iteration steps, so 
we get the claim. I 

Acknowledgements. I am grateful to Peter Hajnal for his support and suggestions. I 
also thank Csaba Biro for his comments on the manuscript. 
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